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Abstract — Space-Time Block Codes (STBCs) suffer from a 
prohibitively high decoding complexity unless the low-complexity 
decodability property is taken into consideration in the STBC 
design. For this purpose, several families of STBCs that involve 
a reduced decoding complexity have been proposed, notably 
the multi-group decodable and the fast decodable (FD) codes. 
Recently, a new family of codes that combines both of these 
families namely the fast group decodable (FGD) codes was 
proposed. In this paper, we propose a new construction scheme 
for rate-1 FGD codes for 2 a transmit antennas. The proposed 
scheme is then applied to the case of four transmit antennas and 
we show that the new rate-1 FGD code has the lowest worst- 
case decoding complexity among existing comparable STBCs. 
The coding gain of the new rate-1 code is optimized through 
constellation stretching and proved to be constant irrespective 
of the underlying QAM constellation prior to normalization. 
Next, we propose a new rate-2 FD STBC by multiplexing two 
of our rate-1 codes by the means of a unitary matrix. Also a 
compromise between rate and complexity is obtained through 
puncturing our rate-2 FD code giving rise to a new rate-3/2 
FD code. The proposed codes are compared to existing codes in 
the literature and simulation results show that our rate-3/2 code 
has a lower average decoding complexity while our rate-2 code 
maintains its lower average decoding complexity in the low SNR 
region. If a time-out sphere decoder is employed, our proposed 
codes outperform existing codes at high SNR region thanks to 
their lower worst-case decoding complexity. 

Index Terms — Space-time block codes, low-complexity decod- 
able codes, conditional detection, non-vanishing determinants. 



I. Introduction 

THE need for low-complexity decodable STBCs is in- 
evitable in the case of high-rate communications over 
MIMO systems employing a number of antennas higher than 
two. This is because despite their low decoding complexity 
that grows only linearly with the size of the underlying 
constellation, orthogonal STBCs suffer from a severe rate 
limitation for more than two antennas (TJ, 0. On the other 
hand, the full-rate alternatives to orthogonal STBCs namely 
the Threaded Algebraic Space-Time (TAST) codes [3| and 
perfect codes J4) have generally a prohibitively high decoding 
complexity. 

The decoding complexity may be evaluated by different 
measures, namely the worst-case decoding complexity mea- 
sure and the average decoding complexity measure. The worst- 
case decoding complexity is defined as the minimum number 
of times an exhaustive search decoder has to compute the 



Maximum Likelihood (ML) metric to optimally estimate the 
transmitted codeword 0, 0, or equivalently the number of 
leaf nodes in the search tree if a sphere decoder is employed, 
whereas the average decoding complexity measure may be nu- 
merically evaluated as the average number of visited nodes by 
a sphere decoder in order to optimally estimate the transmitted 
codeword Q. 

Arguably, the first proposed low-complexity rate-1 code for 
the case of four transmit antennas is the quasi-orthogonal 
(QO)STBC originally proposed by H. lafarkhani [8| and 
later optimized through constellation rotation to provide full- 
diversity lEfl. lHOl. The QOSTBC partially relaxes the orthog- 
onality conditions by allowing two complex symbols to be 
jointly detected. Subsequently, rate-1, full-diversity QOSTBCs 
were proposed for an arbitrary number of transmit antennas 
that subsume the original QOSTBC as a special case ifTTl . 
In this general framework, the quasi-orthogonality stands for 
decoupling the transmitted symbols into two groups of the 
same size. 

However, STBCs with lower decoding complexity may be 
obtained through the concept of multi-group decodability laid 
by S. Karmakar et al. in fl2l . llT3l . Indeed, the multi-group 
decodability generalizes the quasi-orthogonality by allowing 
the codeword of symbols to be decoupled into more than 
two groups not necessarily of the same size. Thanks to 
this approach, one can obtain rate-1, full-diversity 4-group 
decodable STBCs for an arbitrary number of transmit antennas 

m. 

However, due to the strict rate limitation imposed by the 
multi-group decodability, another family of STBCs namely 
fast decodable (FD) STBCs @, Q3], All has been proposed. 
These codes are conditionally multi-group decodable thus 
enabling the use of the conditional detection technique lfT31l 
in which the ML detection is carried out in two steps. The 
first step consists of evaluating the ML estimation of the 
subset of the transmitted symbols which are separable say 
(x\, X2, ■ ■ ■ , Xk), conditioned on a given value of the rest 
of the symbols (xk+i, Xk+2i • - • > %2k) that we may note 
by (xf L ,xf h , . . . ,x k Ah \x k +i,x k +2, ■ ■ ■ ,x 2 k)- In the second 
step, the receiver minimizes the ML metric only over all the 
possible values of (xk+i,Xk+2, ■ ■ ■ ,X2k)- 

Recently, STBCs that are at the time multi-group and fast 
decodable namely the fast group decodable (FGD) codes were 
proposed [17|. These codes are multi-group decodable such 
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that each group of symbols is fast decodable. The contributions 
of this paper are summarized in the following: 

• We propose a novel systematic construction of rate-1 
FGD STBCs for 2 a transmit antennas. The rate-1 FGD 
code for a number of transmit antennas that is not a power 
of two is obtained by removing the appropriate number of 
columns from the rate-1 FGD code corresponding to the 
nearest greater number of antennas that is a power of two 
(e.g. the rate-1 FGD STBC for three transmit antennas 
is obtained by removing a single column from the four 
transmit antennas rate-1 FGD STBC). 

• We apply our new construction method to the case of 
four transmit antennas and show that the resulting new 
4x4 rate-1 code can be decoded at half the worst-case 
decoding complexity of the best known rate-1 STBC. 

• The coding gain of the new 4x4 rate-1 code is optimized 
through constellation stretching [18] and the nonvanish- 
ing determinant (NVD) property |fT9l is proven to be 
achieved by properly choosing the stretching factor. 

• We propose a new rate-2 STBC by multiplexing two 
of the new rate-1 codes by the means of a unitary 
matrix and numerical optimization. Next we achieve a 
significant reduction of the worst-decoding complexity at 
the expense of a rate loss by puncturing the rate-2 code 
to obtain a new rate-3/2 code. It is worth noting that the 
proposed rate-3/2 and rate-2 codes can be viewed as block 
orthogonal STBCs ll20l . 

We provide comparisons in terms of bit error rate (BER) 
performance and average complexity through numerical sim- 
ulations and show that the reduction in worst-case decoding 
complexity comes at the expense of a negligible performance 
loss. We also considered a more practical scenario where 
a time-out sphere decoder [21] is used and show that our 
proposed rate-3/2 and rate-2 codes outperform existing codes 
at high SNR. 

The rest of the paper is organized as follows: The system 
model is defined and the families of low-complexity STBCs 
are outlined in Section II. In Section III, we propose our 
scheme for the rate-1 FGD codes construction for the case 
of 2 a transmit antennas. In Section IV, we address the case of 
four transmit antennas and the coding gain of the new 4x4 
rate-1 STBC is optimized. Next, the rate of the proposed code 
is increased through multiplexing and numerical optimization. 
Numerical results are provided in Section V, and we conclude 
the paper in Section VI. The relevant proofs are provided in 
the Appendices. 

Notations: 

Hereafter, small letters, bold small letters and bold capital 
letters will designate scalars, vectors and matrices, respec- 
tively. If A is a matrix, then A H , A T and det (A) denote 
the Hermitian, the transpose and the determinant of A, re- 
spectively. We define the vec(.) as the operator which, when 
applied toamx n matrix, transforms it into a mn x 1 vector by 
simply concatenating vertically the columns of the correspond- 
ing matrix. The (x) operator is the Kronecker product and S k j is 
the Kronecker delta. The sign(.) operator returns 1 if its scalar 



input is 5= and -1 otherwise. The round(.) operator rounds its 
argument to the nearest integer. The (.) operator when applied 
to a complex vector a returns [5ft (a T ) , Q (a T )] and when 
applied to complex matrix A returns [3? (A T ) ,5(A T )] T . 
For any two integers a and b, a = b (mod n) means that a — b 
is a multiple of n and (m) k means m modulo k. M. n is the 
set ofiixn complex matrices. If A is a finite set, then |„4| 
denotes its cardinality. 

II. Preliminaries 

The baseband MIMO channel input-output relationship may 
be described by: 

Y = X H + W (1) 

TxN r TxN t N t xN r TxN r 

where T is the codeword signalling period, N r is the number 
of receive antennas, N t is the number of transmit antennas, 
Y is the received signal matrix, X is the code matrix, H is 
the channel coefficients matrix with entries hki ~ CM(0, 1), 
and W is the noise matrix with entries Wij ~ £A/"(0, No). 
According to the above model, the i'th row of X denotes 
the symbols transmitted through the Nt transmit antennas 
during the i'th channel use while the n'th column denotes the 
symbols transmitted through the n'th transmit antenna during 
the codeword signalling period T. 

A STBC matrix that encodes 2K real symbols can be 
expressed as a linear combination of the transmitted symbols 
as El : 

2K 

X = ^] A k x k (2) 
k=l 

with Xk e K and the A&, fc = 1, 2K are T x N t complex 
matrices called dispersion or weight matrices that are required 
to be linearly independent over R. A useful manner to express 
the system model can be directly obtained by replacing X in 
(fl} by its expression in (fJJ: 

IK 

Y= (A fe H)x fe + W. (3) 
fc=i 

Applying the vec(.) operator to the above we obtain: 

2K 

vec(Y) = £ (I Nr <g> A fe ) vec (H) x k + vec(W). (4) 

k=l 

where Ijy r is the N r x N r identity matrix. 
If yi, hi and w ; designate the i'th columns of the received 
signal matrix Y, the channel matrix H and the noise matrix 
W, respectively, then equation can be re-written in matrix 
form as: 



yi 






A 2 ^hi 




x% 


+ 


Wx 


YN r _ 




_Aih A r r . . 


• A 2K h Nr 




X2K_ 




_wjv r _ 



y U s w 

(5) 

A real system of equations can be obtained by separating the 
real and imaginary parts of the above to obtain: 

y = Hs + w (6) 



3 



where y,w e R 2N - Txl , and U e M. 2N - Tx2K . Assuming that 
N r T 5= K, the QR decomposition of H yields: 



n = [Qi Qa] 



(7) 



where Qx e R 2N ^ Tx2K ,Q 2 e R 2i\r r Tx(2i\r r T-2fO Q T Qj = 
Sijl, i,j e {1,2}, R is a 2K x 2if real upper triangular 
matrix and is a (2N r T—2K) x 2K null matrix. Accordingly, 
the ML estimate may be expressed as: 

2 



,ML 



arg mm 

seC 



[Qi Q 2 ] T y-[R T O^s 



which reduces to: 



arg min||y' — Rsll 

seC 



(8) 



(9) 



where C is the vector space spanned by information vector s, 
and y' = Qf y. In the following, we will outline the known 
families of low-complexity STBCs and the structure of their 
corresponding R matrices that enable simplified ML detection. 

A. Multi-group decodable codes 

Multi-group decodable STBCs are designed to significantly 
reduce the worst-case decoding complexity by allowing sepa- 
rate detection of disjoint groups of symbols without any loss 
of performance. This is achieved iff the ML metric can be 
expressed as a sum of terms depending on disjoint groups of 
symbols. 

Definition 1. A STBC code that encodes 2K real symbols is 
said to be g-group decodable if its weight matrices are such 
that Sn}, STT}: 

Af A t + Af A fc = 0, VA fc e ft, A ; e ft, 



1 sS i # j «s g, |ft| = m, J m = 2K. 



(10) 



»=i 



where ft denotes the set of weight matrices associated to the 
i 'th group of symbols. 

For instance if a STBC that encodes 2K real symbols is g- 
group decodable, its worst-case decoding complexity order 
can be reduced from M K to Jjf=i ' with M being 
the size of the used square QAM constellation. The worst- 
case decoding complexity order can be further reduced to 
2f=i ' if m e conditional detection via hard slicers 
is employed. In other words, the g-group decodability can be 
seen to split the original tree with 2K — 1 levels to g smaller 
trees each with m — 1 levels. In the special case of orthogonal 
STBCs, the worst-case decoding complexity is 0(1) as the 
PAM slicers need only a fixed number of simple arithmetic 
operations irrespective of the square QAM constellation size. 
The corresponding upper triangular matrix R will be a block 
diagonal matrix: 



R 



Ri 
o 















R„ 



B. Fast decodable codes 

A STBC is said to be fast decodable J5) if it is conditionally 
multi-group decodable. 

Definition 2. A STBC that encodes 2K real symbols is said 
to be FD if its weight matrices are such that: 

,H\. i \H\_ — 0, VAfc e ft, A, eft, 



A; + Af A k 



l^i^j^g, |ft 



K < 2K. 



(11) 



The above definition slightly generalizes the definition of FD 
codes in 1 5 1 where the FD code is restricted to be conditionally 
orthogonal, that is ni = n-i . . . = n g = 1. The advantage of 
FD STBCs is that one can resort to the conditional detection 
to significantly reduce the worst-case decoding complexity. 
For instance, if a STBC that encodes 2K real symbols is 
FD, its corresponding worst-case decoding complexity order 

, — -2K-1 

for square QAM constellations is reduced from \/M to 



M~" " x YjI=i vM * . If the FD code is conditionally 
orthogonal, the worst-case decoding complexity order is re- 
duced to \[~M . Fast decodability reduces to the splitting 
of the last k levels of the real sphere decoder tree into g 
smaller trees each with m — 1, i = l,...,g levels. In the 
particular case of n, = 1, Vi = l,...,g (i.e. the FD code is 
in fact conditionally orthogonal), fast decodability reduces to 
the removal of the last k levels of the real valued tree. The 
upper triangular matrix R takes the following form: 

A B 
C 



R 



where B has no special structure, C is a (2K — k) x (2K — k) 
upper triangular matrix, and A is a block diagonal n x k 
matrix: 

Ri ... 
R 2 ... 



A 











R„ 



with Ri being an, x m upper triangular matrix. 

C. Fast group decodable codes 

A STBC is said to be fast group decodable ITF71 if it is 
multi-group decodable such that each group is fast decodable. 

Definition 3. A STBC that encodes 2K real symbols is said 
to be FGD if: 

Af A; + Af A fe = 0, VA fc e ft, A ; e ft, 

9 (12) 
1 < i ¥= j < g, |ft| = rii, 2j n i = 2if 

i=l 

and that the weight matrices within each group are such that: 
Af A z + Af A fc = 0, VA fe e ft, m , A ; e ft,„, 

(13) 



Um^n<ffi, \Q, 



1,3 I _ 



s 



where R^ is a n, x n, upper triangular matrix. 



where ft m (resp. gi) denotes the set of weight matrices that 
constitute the m'th group (resp. the number of inner groups) 
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within the i'th group of symbols Qi. For instance, if a STBC 
that encodes 2K real symbols is FGD, its corresponding 
worst-case decoding complexity order for square QAM con- 
stellations is reduced from \[M to £]f=i V^Vf * * x 
SfLi VM *' 3 . Similarly, if each group is conditionally 
orthogonal, the worst-case decoding complexity order is equal 
to Xif=i * \ If a real sphere decoder is employed, fast 
group decodability reduces to splitting the original 2K — 1 
real valued tree into g smaller trees each with n,; levels, and 
that for each of these new trees the last k% levels are split into 
gi smaller trees each with n^j — 1 levels. The corresponding 
upper triangular matrix R takes the following form: 



R 



Ri 
o 





R 2 





such that for 1 < i < g, one has: 

[A,; 






R„ 



(14) 



R, 







B, 

Ci 



(15) 



where Bj has no special structure, Cj is a (m — k») x 
(rii — fy) upper triangular matrix, and A* is a fc, x block 
diagonal matrix: 



Rii o 

R,, 2 
















R, 



(16) 



where Rjj being a riij x rijj upper triangular matrix 

III. The proposed FGD scheme 

In this section, we provide a systematic approach for 
constructing rate-1 FGD STBCs for 2 a transmit antennas. 
Recognizing the key role of the matrix representations of the 
generators of the Clifford Algebra over R [2| in the sequel 
of this paper, we briefly review their main properties in the 
following. 

A. Linear Representations of Clifford generators 

The Clifford algebra over K is generated by the generators 
7,: satisfying the following property: 

7i7j + Jjli = -2<%1 (17) 
The above equation can be split in two equations: 

ll = -1 (18) 

Hlj = -IjH v * ^ 3- (I 9 ) 

In J2l, the question about the maximum number of unitary 
representations of the Clifford generators has been thoroughly 
addressed and it has been proven that for 2 a x 2 a matrices 
there are exactly 2a + 1 unitary matrix representations of the 



Clifford algebra generators. The matrix representations of 7* 
denoted lZ(ji) for the 2 a x 2 a case are obtained as 0: 

7^(7i) = ±3 °3 ® 03 • ■ • <8> 03 

^ -v. J 

a 

^•(72) = I 2 o-i(g)(Tl 
^•(73) = I 2 »-i®CT2 

^(72/c) = l 2 °-* ® °~i ®q~3 g> 3 ■ ■ ■ g 03 

fc-1 

^•(72fc+l) = I 2 °~fc ® 02 ®CT 3 (g) g 3 ■ . ■ (X) 0-3 

fe-1 



T^{l2a) = 01 <g> 03 ® 3 ■ ■ ■ 03 

a-1 

^(72a + l) = CT 2 p"3 ® Q - 3 • ■ • Q~3 

a-1 



where 





" 1" 




"0 


3 




"1 " 


01 = 


-1 


> 02 = 


3 





, 03 = 


-1 



(20) 



(21) 



From now on, we will denote TZ(%) by R^ for simplicity. 
The properties of the matrices R,, i = 1, . . . , 2a + 1 can be 
summarized as follows: 

Rf = -m, R? = -I, VI < i < 2o + 1, 
and RiRj + R^-Ri = 0, VI < i ^ j < 2a + 1. 



B. A systematic approach for the construction of FGD codes 

Proposition 1. For 2 a transmit antennas, the two sets of 
matrices, namely Q\ = {I, Ri, . . . , R a } u A and Q2 = 
{R a+ i, . . . , R2a+i} u B satisfy (110b where A and B are given 
in Tableland Table [77] respectively, and 5^t(m),5g(m) are 
given in Table 1/7/1 



TABLE III 
Different cases for 5 



a 


6A(m) 


$b(to) 


4n 


(H 4 -i)(("i 


>4"2) 


2-(m) 4 


2 


2 


4n + 1 


((m) 4 ((m) 4 - 


!)) 4 


(m) 4 -l 


2 


2 


4n + 2 


((m) 4 ((m) 4 - 




(™) 4 
2 


An + 3 


(((m) 4 -2)((m) 


4-3)) 4 


3-(m) 4 


2 


2 



See Appendix A for proof. 

Proposition 2. The rate of the proposed FGD codes is equal 
to one complex symbol per channel use regardless of the 
number of transmit antennas. 

See Appendix B for proof. 
Examples of the rate-1 FGD codes for 4,8 and 16 transmit 
antennas are given in Table [IV] 
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TABLE I 
Different cases for A 



a 


A 


An 


{ 




J m = l 




1 


An + 1 






m=l I 


jSM m) n ^J a l 1 R, n ™ ! : 1 < fci < . . . < k m < a} 




An + 2 






m — 1 1 


/^ (m) rK+i R, n™ i R fci : 1 < ki < . . . < k m < a] 




An + 3 


{ 




) m — 1 


{f Am) Uttli R» n™ i R^ ■ 1 < fci< • ■ • < k m < a 


i 



TABLE II 
Different cases for B 



a 


B 


4n 


{J nti R*l m ^ 2 2 4 {/ s(m) nti R * Uti Rfe. : « + 1 *S k x < . . . < k m < 2a + 1} 


4n + 1 


a G 2 {j 5 ^™) nti R* 11" i R*, : a + 1 < h < . . . < k m < 2a + 1} 

m= 1 ,3 


An + 2 


inti R«i „5 2 2 4 {j 5Birn) nti R * n™ i r** = « + 1 < h < . . . < k m < 2a + 1} 


An + 3 


°0 2 nti R* Ylti Rfe, : a + 1 < fci < • • • < fcm < 2a + 1} 

m— 1,3 



TABLE IV 
Examples of rate-1 FGD codes 



Tx 


Gi 


G2 


4 


I, R2, R4, R1R3R5 


Ri , R3 , R5 , R2R4 


8 


I, R2, R4, R6 

JR1R3R5R7 
JR1R3R5R7R2 
JR1R3R5R7R4 
JR1R3R5R7R6 


Ri, R3, R5, R7 
J/R2R4R6R1 
JR2R4R6R3 
J/R2R4R-6R5 
JR2R4R6R7 


16 


I, R2 , R4 , R-6 ! Rs 
JR1R3R5R7R9 
R1R3R5 R7R9R2 
R1R3R5R7R9R4 

R1R3R5R7R9R6 
R1R3R5R7R9R8 
Ri R3R5R7R9R2R4 
R1R3R5R7R9R2R6, 
R1R3R5R7R9R2R8 

RlR3R5R7RgR4R6, 
R1R3R5R7R9R4R8 

R1R3R5R7R9R6R8 


Ri , R3 , R5 1 R7 , R9 

JR2R4R6R8 
R2R4R-6R8R1R3 
R2 R4R6R8R1R5 
R2R4R6R8R1R7 
R2 R4R6 RsRiRg 
R2 R4 R6 Rs R3 R5 

R2R4R6R8R3R7 
R2 R4 R6 R8 R3 R9 

R2 R4 R6 R8 R5 R7 
R2 R4 R-6 R8 R5 R9 
R2R4R6R8R7R9 



IV. The four transmit antennas case 

In this section, a special attention is given to the case of four 
transmit antennas. The proposed rate-1 FGD code arises as the 
direct application of Proposition 1. in the case of four transmit 
antennas. Setting a = 2, we have Q\ = {I, R2, R4, R1R3R5} 
and Q2 = {Ri, R3, R5, R2R4}- It is worth noting that the 
above choice of weight matrices guarantees an equal average 
transmitted power per transmit antenna per time slot. 



A. A new rate-1 FGD STBC for four transmit antennas 
The Proposed rate-1 STBC denoted Xi is expressed as: 

Xi(s) =Ixi + R2^2 + R4^3 + RlR3R5^4 + 
R1X5 + R3X6 + R5X7 + R2R4^8- 

According to Definition 3., the proposed code Xi is FGD 
with g = 2, rii = ri2 = 4 and g\ = gi = 3 such as m t j = 
1, i = 1,2, j = 1,2,3. Therefore, the worst-case decoding 
complexity order is 2\f~M. However, the coding gain of Xi is 
equal to zero, and in order to achieve full-diversity, we resort 
to the constellation stretching [ 18 1 rather than the constellation 
rotation technique, otherwise the symbols inside each group 
will be entangled together which in turns will destroy the FGD 
structure of the proposed code and causes a significant increase 
in the decoding complexity. 

The full diversity code matrix takes the form of (l24t where 
s = [xi, . . . , x$] and k is chosen to provide a high coding 
gain. The term y j^jp is added to normalize the average 
transmitted power per antenna per time slot. 

Proposition 3. Taking k = y§> ensures the NVD property 
for the proposed code with a coding gain equal to 1. 

See Appendix C for proof. 
For illustration purposes, a comparison between regular and 
stretched 16-QAM constellation points is depicted in Fig Q] 
where the dark dots denote the regular 16-QAM constellation 
points whereas the red squares denote the stretched 16-QAM 
constellation points with stretching factor k = y|. 

B. The proposed rate-2 code 

The proposed rate-2 code denoted X2 is simply obtained by 
multiplexing two rate-1 codes by means of a unitary matrix. 
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Xi(s) 



1 + k 2 



x\ + ikx§ X2 + ikxe X3 + ikx^ —ikx^ — 

— x 2 + ikx 6 X\ — ikx§ —ikx^ — x% — X3 — ikx-j 

—x 3 + ikxf ikx^ + x 8 Xi — ikx§ X2 + ikx^ 

ikx4 + x$ 23 — ikx-j —X2 + ikxe x% + ikx$ 



(24) 



Q 



Fig. 1. regular (•) versus stretched (■) 16-QAM constellation points 

Mathematically speaking, the rate-2 STBC is expressed as: 

X 2 (xi, . . . ,xi 6 ) = Xi (xi, . . . , x 8 ) + eP^Xi (x 9 , . . . ,x 16 ) U 

(25) 

where U and <f> are chosen in order to maximize the coding 
gain. It was numerically verified for QPSK constellation that 



maximizes the 



taking U opt = jRi and </> opt = tan 
coding gain which is equal to 1. 

To decode the proposed code, the receiver evaluates the 
QR decomposition of the real equivalent channel matrix % 
©. Thanks to the FD structure of the proposed rate-2 code 
with K = 8, k = 8, n\ = ni = 4, the corresponding upper- 
triangular matrix R takes the form: 



R 



A B 
C 



(26) 



p8x8 



where B e 
upper triangular matrix and A e 



has no special structure, C e 



p8x8 



takes the form: 



X 








X 

















X 





X 




















X 


X 























X 


























X 








X 

















X 





X 




















X 


X 























X 



(27) 



where x indicates a possible non-zero position. For each value 
of (29, . . . , Xiq), the decoder scans independently all possible 
values of X4 and x&, and assigns to them the corresponding 6 
ML estimates of the rest of symbols via hard slicers according 
to equations (l28l i and (|29T > at the top of the next page where rij 
denote the entries of the upper-triangular matrix R. Therefore, 



instead of evaluating the ML metric M 8 times, the sphere 
decoder needs only 2M 4 5 ML metric evaluations. 

A rate-3/2 code that will be denoted X 3 / 2 may be easily 
obtained by puncturing the rate-2 proposed code in (l25t , and 
may be expressed as: 

X3/2 (xi, . . . , X12) =Xi (xi, . . . , x 8 ) + ^ 
e i ^X 1 (x 9 ,...,Xi2)Uop, 



V. Numerical and simulation results 

In this section, we compare our proposed codes to similar 
low-complexity STBCs existing in the literature in terms of 
worst-case decoding complexity, Peak-to- Average Power Ratio 
(PAPR), average decoding complexity, coding gain, and bit 
error rate (BER) performance over quasi-static Rayleigh fading 
channels. One can notice from Table [V] that the worst-case 
decoding complexity of the proposed rate-1 code is half that 
of ll24l and l23l . The worst-case decoding complexity of the 
proposed rate-3/2 code is half that of the punctured rate-3/2 
PSrinath-S.Rajan code |26| and is smaller by a factor of 
y/M/2 than the rate-3/2 S.Sirianunpiboon et al. code l25ll . 
The worst-case decoding complexity of our rate-2 code is half 
that of the rate-2 PSrinath-S.Rajan code ||261 and smaller by 
a factor of VAf/2 than the rate-2 TP Ren code et al. ||27l . 

Simulations are carried out in quasi-static Rayleigh fading 
channel in the presence of AWGN for 2 receive antennas. The 
ML detection is performed via a depth-first tree traversal with 
infinite initial radius sphere decoder. The radius is updated 
whenever a leaf node is reached and sibling nodes are visited 
according to the simplified Schnorr-Euchner enumeration [28]. 

From Fig. [2] one can notice that the proposed rate-1 code 
loses about 0.6 dB w.r.t to Md.Khan-S.Rajan rate-1 code J24l 
while offering similar performance to M.Sinnokrot-J. Barry 
rate-1 code ll23l at 1CP 3 BER for several spectral efficiencies 
namely 2,4, and 6 bpcu. 

From Fig. [3] one can notice that the proposed rate-3/2 code 
loses about 0.6 dB w.r.t the punctured PSrinath-S.Rajan code 
[26 1 while it gains about 0.4 dB w.r.t S.Sirianunpiboon et al. 
code [25 1 at 10~ 3 BER for several spectral efficiencies namely 
3,6, and 9 bpcu. Moreover, from Figs |4(a)] |4(b)| and |4(c)| one 
can easily verify that the proposed rate-3/2 code maintains its 
lower average decoding complexity for the considered spectral 
efficiencies while the average decoding complexity of the 
S.Sirianunpiboon et al. code increases with the size of the 
underlying constellation. 

From Fig. [5] one can notice that the proposed rate-2 code 
loses about 0.8 dB w.r.t the PSrinath-S.Rajan code [26| while 
gaining about 0.25 dB w.r.t TP. Ren et al. code (23 at 10~ 3 
BER. From Fig. [6] it is easily noticed that our proposed 
code is decoded with lower average decoding complexity 



where: 



xf L | (x4, X9, . . . , ii6) = sign (m) x min|j2 round( {z% — l)/2) + l|, s/M — lj, i = 1,2,3 
xf L \ {x 8 ,x 9 , . . . ,x 16 ) = siga(zj) x min[j2 round ( t- l)/2) + l|, VM - lj, j = 5,6,7 
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ViTiAX A 



2 n.fe^fe /n,t) « = 1,2,3, 



fc=9 



' 16 \ 

u'j - r jfi x 8 - Yj r hkXk J /r-jj, j = 5, 6, 7 

V fc=9 / 



(28) 
(29) 



TABLE V 

SUMMARY OF COMPARISON IN TERMS OF WORST-CASE COMPLEXITY, MlN DET AND PAPR 



Code 


Worst-case 


Min det (= s/S) 


PAPR (dB) 




complexity 


for QAM constellations 


QPSK 


16QAM 


64QAM 


The proposed rate-1 code 




1 





2.5 


3.7 


M.Sinnokrot-J. Barry code [ 23 ] 


Wm 


7.11 





2.5 


3.7 


Md.Khan-S.Rajan code [24 1 




12.8 


5.8 


8.3 


9.5 


The proposed rate-3/2 code 


2M' z - b 


1 (verified for 4-QAM) 


3 


5.6 


6.7 


S.Sirianunpiboon et al. code |25| 


M A 


N/A 


5.4 


8 


8.4 


P.Srinath-S.Rajan rate-3/2 code |26| 


AM' 2A > 


12.8 


4 


6.5 


7.7 


The proposed rate-2 code 


2Af 4 - 5 


1 (verified for 4-QAM) 


2.8 


5.3 


6.5 


P.Srinath-S.Rajan code |26] 


4M 4J 


12.8 (verified for 4/16-QAM) 


2.8 


5.3 


6.5 


Tian Peng Ren et al. rate-2 code |27 | 


M b 


N/A 


4 


7 


7.66 
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Fig. 2. BER performance for 4x2 system 



Fig. 3. BER performance for 4x2 system 



than T.P. Ren et al. code [27] over the entire SNR range 
while it maintains its lower average decoding complexity w.r.t 
P.Srinath-S.Rajan code [26 1 in the low SNR region. 

Next, we considered a more practical scenario where a time- 
out sphere decoder lETI is employed. In fact, the tree-based 
search is terminated if a predetermined limit on the number of 
visited nodes is exceeded and the sphere decoder returns the 
current codeword estimation. In Fig. |7]we fixed a threshold of 
50, 500, 5000 nodes count at 3, 6, and 9 spectral efficiencies 
respectively. It can be verified that our rate-3/2 proposed code 
outperforms the punctured P.Srinath-S.Rajan code [26 1 and the 
S.Sirianunpiboon et al. code [25 1 at high SNR for 3 and 6 bpcu 
spectral efficiencies. In Fig. [S] the threshold is fixed at 1000 
nodes count. One can easily verify that the proposed code 
outperforms the P.Srinath-S.Rajan code [26 1 and the T.P. Ren 
et al. code ll27ll code at high SNR. This can be justified by 



the fact that the maximum number of visited nodes is related 
to the worst-case decoding complexity which is lower in our 
proposed codes. 

VI. Conclusions 

In the present paper we have proposed a systematic ap- 
proach for the construction of rate-1 FGD codes for an 
arbitrary number of transmit antennas. This approach when 
applied to the special case of four transmit antennas results in 
a new rate-1 FGD STBC that has the smallest worst-case de- 
coding complexity among existing comparable low-complexity 
STBCs. The coding gain of the proposed FGD rate-1 code 
was then optimized through constellation stretching. Next we 
managed to increase the rate to 2 by multiplexing two rate- 
1 codes through a unitary matrix. A compromise between 



2 4 6 8 10 12 

SNR per receive antenna 

(a) Average complexity for 4x2 system at 3 bpcu 



i] 




II 



I) 

























a P.Srinath-S.Rajan rate-3/2 code 
S.Sirianunpiboon et al, code 
Proposed rale- 3/2 code 





















































































































































































2 4 6 8 10 12 14 16 18 20 22 24 
SNR per receive antenna 

(b) Average complexity for 4x2 system at 6 bpcu 
■10 J 





























S.SirianunpLboon et al code 
I'.SriiiLilli-S.RLijun rate- 3/2 code 
Proposed rate- 3/2 code 























































































































































































































2 4 C 8 10 12 14 16 18 20 22 24 2fi 28 
SNR per receive antenna 

(c) Average complexity for 4x2 system at 9 bpcu 
Average complexity for 4x2 system at 3, 6 and 9 bpcu 



• T.P.Ren et al rate-2 code 
■ Proposed rate-2 code 
> P.Srinath-S.Rajan rate-2 code 



6 8 10 12 

SNR per receive antenna 



14 16 



Fig. 5. BER performance for 4x2 system 





































































































































— ■— T.P.Ren el al rate-2 code 

P.Srinath-S.Rajan rate-2 code 
— Proposed rate-2 code 

























u 600 

T3 

o 
ti 

| 500 

"> 

1 400 
£ 
| 

t 300 

— 

•S 200 
100 

°0 2 4 6 8 10 12 

SNR per receive antenna 

Fig. 6. Average complexity for 4x2 system 




10 12 14 16 18 20 22 24 26 28 30 
SNR per receive antenna 



Fig. 7. BER performance for 4x2 system with timeout sphere decoder 



9 




have: 



. H 



moreover, it is easy to prove that: 

m odd I e {ki, . 
m even I $ {ki, . 

AR, 



AR, 



m even I e {fci, 
m odd I $ {kx, 



(31) 



(32) 



It is easy to see that Qi\Au G?\B is the set of weight matrices 
for the Complex Orthogonal Design (COD) for 2 a transmit 
antennas. Thus we need only to prove that the Q\, and Q2 
satisfy ([ToJ. Towards this end let A fe 6 A, B; 6 B, C e Qi\A, 
and D e Qo\B. Let a = 4n, we have according to Table [TTJ 



6 8 10 12 

SNR per receive antenna 



16 



Bi = ifjRi 



(33) 



»=i 



Fig. 8. BER performance for 4x2 system with time out sphere decoder and B m / 2+ i is given by d34"l > at the top of the next page. 



complexity and throughput may be achieved through punc- 
turing the proposed rate-2 code which results in a new low- 
complexity rate-3/2 code. The worst-case decoding complexity 
of the proposed codes is lower than their STBC counterparts 
in the literature. 

According to the simulations results, the proposed rate- 
1 code loses about 0.6 dB w.r.t to Md.Khan-S.Rajan rate-1 
code [24] while offering similar performance to M.Sinnokrot- 
J. Barry rate-1 code lf23l at 10 -3 BER for several spectral 
efficiencies namely 2,4, and 6 bpcu. The proposed rate-3/2 
code loses about 0.6 dB w.r.t the punctured P.Srinath-S.Rajan 
code |26| while it gains about 0.4 dB w.r.t S.Sirianunpiboon 
et al. code iTZBI at 10~ 3 BER for several spectral efficiencies 
namely 3,6, and 9 bpcu. Moreover, the proposed rate-3/2 
code maintains its lower average decoding complexity for the 
considered spectral efficiencies. 

The proposed rate-2 code loses about 0.8 dB w.r.t the 
P.Srinath-S.Rajan code [26 1 while gaining about 0.25 dB w.r.t 
TP. Ren et al. code 11271 at 10~ 3 BER . Our proposed code is 
decoded with lower average decoding comlexity than TP Ren 
et al. code ll27l over the entire SNR range while it maintains 
its lower average decoding complexity w.r.t P.Srinath-S.Rajan 
code 1 26 1 in the low SNR region. 

Next, we considered a more practical scenario where a time- 
out sphere decoder is employed. Our rate-3/2 proposed code 
outperforms the punctured P.Srinath-S.Rajan code [26] and the 
S.Sirianunpiboon et al. code [25 1 at high SNR for 3 and 6 bpcu 
spectral efficiencies. The proposed rate-2 code outperforms the 
P.Srinath-S.Rajan code [26| and the TP Ren et al. code l27l 
code at high SNR. This can be justified by the fact that the 
maximum number of visited nodes is related to the worst-case 
decoding complexity which is lower in our proposed codes 

Appendix A 

Proof: From the properties of the matrix representa- 
tions of the Clifford algebra generators over R (1221) . it is 
straightforward to prove that for a matrix A e if 



n,=iR* : 1 < fci< 



< k m ^ 2a + 1, then we 



m/2+1 

Consequently, from (fSTJ: 

B H = B, VB e B 
and from d32l one has: 

B H C + C H B = 0, VB e B. 
On the other hand from Table J] we have: 



(35) 



(36) 



and A m is given by 
d3B . it follows that: 

Af 

A H 



=j n r * (3?) 

i — a+1 

at the top of the next page. From 



Ai 



A m 
A 



ra even 
m odd 



(39) 
(40) 



and from ( 1321 . we obtain: 

A H B + T> H A = 0, VA e A. (41) 

Finally, from Eqs d32j, (|33j, (El, (T37}, and (fJUlwe get: 

A H B + B H A = 0, VA e A, B e B. (42) 

It remains to prove that the proposed weight matrices are 
linearly independent over R. Towards this end recall from 
[29 1 that if {M^ : k = 1, . . . , 2a} are pairwise anti-commuting 
matrices that square to a scalar, then the set: 



Bo 




r, 1 2a 

, 2a\ u 

rn=2 



< k m s$ 2a 



forms a basis of M.^ over C. Consequently, the set £>2<> u j£>2o 
forms a basis over M. Thanks to the properties of the matrix 
representations of Clifford algebra generators (f22l . the set of 
matrices defined in d43l forms a basis of M.ia over R: 



{I,jI}u{R fc) jR fc :fc=l, 



1 1 2a 
. . , 2a\ u 

m=2 



f] R fci , jf] R fci : 1< h < . . . < km ■ 2a 



(43) 
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B 



'm/2+1 



jrc=i^nr=iRfc, a+i< **<... . ,. 
nr=i R in™iRfe, a+i^kK ...<k 



n2a+l tj i-rm tj 

n2a+l -rj i-rm -rj 
*=a + l K * 1 li=l K fci 



i ^ fci < 

1 s$ ki < 
1< fei < 

1 fei < 



< fc m < 2a + 1, m = An, n' ^ 
'; m < 2a+ 1, m = 4n' + 2 



< fc m < a, m = 4n', n' ¥= 

< a, m = An' + 1, ^ 

^ a, m = An' + 2 

^ a, m = An' + 3 



(34) 



(38) 



But from ( 120) , one has: 

2a+l 

which reduces for the case of a = An to: 

2a+l 



II Ri = ±j'Ia«. 



(44) 



(45) 



In what follows, we will express the proposed weight matrices 
in terms of Rj, i = 1, 2, . . . , 2a thanks to 1451 . Readily, the set 

£ 2 \£ becomes |R a+ i, . . . , R 2a , ±j Y\a=i After some 
manipulations, the sets A in Table J] and £> in Table [XT] may 
be re-written as in (l46l and (l47l i respectively at the top of 
the next page. Now it can be easily verified that Q\ u Q2 is 
a subset of the basis in (143V The proofs for other cases of a 
follow similarly and are therefore omitted. ■ 

Appendix B 

Proof: The rate of the proposed FGD codes for the case 
of 2 a transmit antennas may be expressed as: 

2a + 2 + 1.4| + |Bl 

2a+l x ' 

However form Table [I] regardless of a we have: 

a— 2 / \ a—2 / \ 

On the other hand, from Table [XT] we have for a even: 



a— 2 / , 1 \ a—2 / 

w - ,+ X(T)- ,+ J_G^ 



+ 



a— o / \ a— 

2 : + s 



j=l,3... VJ/ i=2,4... 
a—2 / \ a-2 



a—z / \ a— z / \ 



where we used the recursion identity: 



(a + 1) 



n-l\ /ra-1 
fc - 1 J + I fc 



(50) 



(51) 



Similarly, for a odd, we have: 

a—2 / i i\ a — 2 



i=l,3,. 
a— 3 



i=1.3. 



s 



3=0,2 
a-2 



a-2 



+ 



2 " = 2 



a 

i- 1 

a-2 



(52) 



2 " =2 a -(a + l). 



(53) 



Finally, using these relations, we get: 

R=l. 

Thus concluding the proof. ■ 
Appendix C 

Proof: The proposed code is 2-group decodable 
and the corresponding two sub-codes will be 
denoted by Xi = X(:Ei, £2, £3, X4, 0, 0, 0, 0) and 
Xn = X(0, 0, 0, 0, X5, xq, xr, xg) to avoid any ambiguity. 
The coding gain <5x is equal to the minimum Coding Gain 
Distance (CGD) |30l, or mathematically: 

S x = min detf(X(s)-X(s')) W (X(s)-X(s')) 

s,s'eC 



CGD(X(s),X(s')) 

min |det((X(As)))| 2 

AseAC\{0} 



(54) 



where As = s — s', AC is the vector space spanned by As. 
Thanks to the quasi-orthogonality structure one has [31 : 

5 X = min{<5 X i,£xii} ■ (55) 
The coding gain of the first sub-code is expressed as: 



(A~ 2 



I + Ax\ + Ax\ - k 2 Axl) ( - 



Choosing k 



+ k 2 



, the above expression becomes: 



(56) 



(5Ax\ + 5Axl + 5Axj - 3Aa 



1 + 



(57) 

where Axi = 2ni, n% e Z. Consider the Diophantine 
quadratic equation below: 

5(X 2 + X 2 + X 2 ) - 3Xl X 1 ,X 2 ,X 3 , X 4 eZ (58) 

In order to find a solution we resort to the following theorem 
133: 
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m = 2 



1 Yl Rfc< : 1 s$ ki < . . . < k m sS 



(46) 



»=i 



a-2 
m=2,4 



m=3,o 



a rn ) 

«B(m) j-j Ri j-j Rfc . a + x ^ fci < < fcm ^ 2a \ 

i=l i=l J 

(a+i-m) + i F| Rfc :a + 1 < &i < ... <fc m < 2a 



(47) 



Theorem 1. 77ie equation 



a\x\ + a^x 2 , + a^x 2 + 04X4 



0. 



(59) 



is solvable in rational integers iff the coefficients 
Oj, i = 1, . . . , 4 are smc/i that: 

if 01020304 ee 1 {mod 8), f/zen we require 
Oil + 0,2 + 03 + 04 ee (mot/ 8). There are no 
conditions if 01020304 = 2,3,5,6,7 (moc/ 8). In the 
case 01020304 = 4 (moc/ 8) ant/ ai = 02 = (mod 2) then 
if ^01020304 = 1 (mod 8) ;f is required that: 



1 



1 



1 



,2„2 



-ai + -a 2 + a 3 + a 4 = - (0304 

Afo conditions are required if: 
1 



l) (mot/ 



-a 1 02a304 = 3, 5, 7 (mod . 



(60) 



(61) 



Consequently equation ( l58l l equals to iff Xi = X2 = X3 = 
X4 = 0, this follows directly from the above theorem as — 3 x 
5x5x5=1 (mod 8) with 5 + 5 + 5- 3= 12 = ±A (mod 8). 
Moreover, one has: 

5(Xf + Xf + Xf)-3Xf^ +1. (62) 



Otherwise, we must have 

r2 



3X| ee +1 (mod 5) 



(63) 



which cannot be true, since the quadratic residues modulo 
5 are 0,1 and 4 (33], thus 3Xf ee 0,+3 or + 2 (mod 5). 
Therefore, we can write: 

h(Xl+Xt+Xj)-ZXl\ > 2, V(X 1 ,X 2 ,X 3 ,X A ) * (64) 

The above equality holds for many cases, take for instance 
X\ = X 2 = l,X 3 = Xi = 0. It is worth noting that the 
numerator of the expression (l57t is a special case of the Dio- 
phantine equation in d58l > as Axi = 2rii, rii e Z, i = 1, 2, 3, 4. 
Therefore thanks to the above inequality one has: 

■ 2 4 



3 X! 



(8/5)* 



(8/sr 



16. 



(65) 



The coding gain of the second sub-code is expressed as: 



For k 



(k 2 Ax 2 5 + k 2 Ax 2 6 + k 2 Ax 2 7 - Ax 2 8 

g, the above expression becomes: 
(3Ax§ + 3Axl + 3Ax?. - 5Axf) 



1 + k 2 



(66) 



1 + f 



Consider the Diophantine quadratic equation below: 

3(Xl + X 2 + X 2 ) - 5X 2 , X 5 , X 6 ,X 7 , X 8 eZ (68) 

It is easy to verify from Theorem 1. that the above equation 
equals iff X 5 = X 6 = X 7 = X 8 = as -3 x 3 x 3 x 5 ee 
1 (mod 8) with 3 + 3 + 3- 5 = 4=. +4 (mod 8). 
However, we have: 

3(Xl+Xl+X 2 )-bXl\ > 1, \/(X 5l X 6 ,X 7 ,X s ) * (69) 

The above inequality holds for instance by taking X 5 = 
0, X e = X 7 = X 8 = 1. By noting that the nominator in 
expression ( f67b is a special case of the Diophantine equation 
(l68l l as Axi = 2rij, n, e Z, i = 5, 6, 7, 8, then thanks to the 
above inequality we have: 

^4 2 4 



(4/5)* 



(8/5) 4 



1 



(70) 



and thus <5x 
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